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Dough development is visualised as a re-orientation of glutenin 
polymers to form a membrane network with viscoelasticity and gas retaining 
properties. Covalent (disulfide) and noncovalent (hydrogen, hydrophobic 
and ionic) bonds are involved in formation of a fully developed dough. 
Interactions are further modified during fermentation, baking and even after 
baking. The disulphide bonds of flour proteins play a key role in the 
interactions in dough. The bonds form relatively strong crosslinks within 
and between polypeptide chains and also stabilise other less energetic bonds. 
Disulphide bonds provide the required stability for the protein matrix until 
the loaf structure is set by the gelatinisation of the starch and the thermal 
denaturation of the proteins during baking. Hydrogen bonds are considerably 
weaker than covalent bonds, but contribute significantly to the structure of 
dough. A unique feature is the ability to interchange with other hydrogen 
bonds, which facilitate reorientation of protein chains and allow for stress 
relaxation. Hydrophobic bonds result from nonpolar groups of flour 
constituents. Because these bonds are reversible, they can readily 
accommodate viscous flow and thereby facilitate mechanical dough 
development. Ionic bonds play relatively small part in dough structure 
formation but some specific components have an ionisable part or parts. 
Therefore ionic bond interactions could be important for the rheological 
properties (for a review, see Bushuk 1998). 

A major limitation to evaluating the contributions of various groups of 
proteins, and of specific structural features of these molecules, to dough 
functionality has been the lack of appropriate systems that allow specific 
proteins to be incorporated and tested within the dough. The situation has 
recently changed, however, due to two advances. The first is the 
development of small scale testing equipment (Mixograph, Extensograph) 
with appropriate procedures for incorporating exogenous proteins, including 
polymeric glutenins into the dough (Bekes et al., 1994). Advantages of this 
system are the small amount of proteins required for test and the ability to 
rapidly test multiple samples produced by, for example, protein engineering. 
The second recent advance is the development of a reliable transformation 
system for wheat (Weeks et al., 1993, Witrzens et al., 1998), which allows the 
modification of storage protein composition by the expression of new 
proteins with, for instance, designed characteristics. 
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To alter protein-protein, protein-lipid and protein-starch interactions 
within the gluten matrix, the present inventors have developed a system 
which enables the incorporation of new surface active molecules or parts of 
molecules into the gluten matrix. 
Disclosure of Invention 

In a first aspect, the present invention consists in a method of 
producing a modified glutenin or seed-storage protein, the method 
comprising adding to the protein a domain which confers to the modified 
protein the ability to incorporate into gluten or bind a ligand or other 
macro molecule. 

In one preferred embodiment, the modified glutenin or seed-storage 
protein contains one or more amino acid residues added to its amino acid 
sequence. More preferably, the one or more amino acid residues are one or 
more cysteine residues. Preferably, the one or more cysteine residues are 
incorporated at one or both ends of the amino acid sequence of the protein. 
The addition of the one or more cysteines allows the modified proteins to be 
more easily incorporated into gluten in use. The modifications to the 
glutenin or seed-storage proteins produced according to the present invention 
allow the incorporation of that protein into the gluten network for food or 
industrial use. 

The present inventors have found that incorporating exogenous amino 
acid sequences (domains) from proteins other than glutenins into glutenin or 
seed-storage proteins modifies the general properties of gluten when the 
proteins are used in a range of food applications. 

Figure 15 provides a schematic of the scheme for identifying 
transgenes in transformed wheat plants by polymerase chain reaction (PCR). 
A primer pair straddling the interface between the gene and its promoter 
(from the gene for the high molecular weight glutenin Bxl7 for example) 
ensures that no false positives are detected arising from the high homology 
between C hordein and gliadin genes. 

In a further preferred embodiment, the domain is a binding domain 
that confers to the modified protein the ability to bind a ligand or other 
macromolecule. 

The binding domain can be any domain that will bind ligands that may 
be useful in food preparation or in food compositions. In a preferred form, 
the binding domain is a ligand capable of binding lipids or starches. The 
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present inventors have found that the lipid-binding domain of the barley 
oleosin gene, the lipid-binding regions of the wheat CM16 protein, and the 
starch-binding domain of the glucoamylase from./\spergi//us niger are 
particularly suitable for the present invention. It will be appreciated, 
5 however, that other natural or modified domains would also be suitable for 
the present invention. 

One glutenin or seed-storage protein that has been modified by the 
present inventors is the C hordein gene from barley. It will be appreciated, 
however, that other glutenin or seed-storage proteins may also be modified 

10 according to the present invention. In wheat, such glutenin or seed proteins 
include low molecular weight glutenins, high molecular weight glutenins, 
gliadins, puroindolines or grain softness proteins (also known as friabilins), 
or Chloroform/Methanol-soluble proteins. Homologues of these proteins 
exist in other cereals such as diploid, tetraploid and hexaploid wheats, rye, 

15 triticale, barley, oats, rice, sorghum, millet and maize and the genes encoding 
these proteins may also be modified according to the present invention. 

In a second aspect, the present invention consists in a modified 
glutenin or seed-storage protein having a domain inserted therein which 
confers to the protein the ability to incorporate into gluten or bind a ligand or 

20 other macromolecule. 

In one preferred embodiment, the modified glutenin or seed-storage 
protein is produced by the method according to the first aspect of the present 
invention. 

In another preferred embodiment, the modified glutenin or seed- 
25 storage protein is ANG/SBD/Cys7Cys236, ANG/OHBD/Cys7Cys236 or 
ANG/CMl6/Cys7Cys236. 

In a third aspect, the present invention consists in an isolated nucleic 
acid molecule encoding a modified glutenin or seed-storage protein according 
to the second aspect of the present invention. 
30 In a fourth aspect, the present invention consists in an isolated nucleic 

acid molecule according to the third aspect of the present invention 
incorporated into a cell such that on expression of the nucleic acid molecule, 
the cell produces the modified glutenin or seed-storage protein. 

The cell may be a recombinant bacterial cell, for example, which is 
35 capable of producing the modified glutenin or seed-storage protein. 

Preferably the bacterial cell is Escherichia coli. Alternatively, the cell may be 
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a yeast such as Pichia sp. or Saccharomyces cerevisiae, an insect cell using an 
expression system such as the baculovirus expression system, or a 
mammalian cell. Alternatively, the cell may be a plant cell of a recombinant 
plant which is capable of producing the modified glutenin or seed-storage 
5 protein in the plant's seeds. Preferably the plant cell is a recombinant wheat 
cell. 

In fifth aspect, the present invention consists in the use of a modified 
glutenin or seed-storage protein according to the second aspect of the present 
invention in the preparation of a food product. 
10 Examples of food products include leavened or unleavened breads, 

pasta, noodles, breakfast cereals, snack foods, cakes, pastries or other foods 
containing flour-based sauces or ingredients. 

The modified glutenin or seed-storage proteins according to the present 
invention, in use, are capable of modifying the structure of doughs and other 
15 materials containing gluten in ways which add value and utility to the 
resultant product. The modified glutenin or seed-storage proteins are 
suitable for use in the food industry as modifiers of food properties. 

The present inventors have shown that modified proteins according to 
the present invention can be produced in bacterial fermentation and that 
20 large scale production of the proteins for commercial use is possible. 

In sixth aspect, the present invention consists in the use of a modified 
glutenin or seed-storage protein according to the second aspect of the present 
invention in the preparation of a non-food product. 

Examples of non-food products include, but not limited to, films, 
25 coatings, adhesives, building materials or packaging materials. It will be 
appreciated that the modified proteins according to the present invention 
would have the same non-food uses as for normal glutenin or seed-storage 
proteins. 

In a seventh aspect, the present invention consists in the use of a grain 
30 or part of a grain containing a modified glutenin or seed-storage protein 

according to the second aspect of the present invention in the preparation of 
a food product. 

It will be appreciated that the modified glutenin or seed-storage 
proteins according to the present invention may be contained in, or produced 
35 by, a transgenic plant produced by the fourth aspect of the present invention. 
Thus, the grain or other plant products produced by these plants may be used 
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in the crude form of flour, semolina, bran, pollard, germ fraction, or the like. 
Such plants may produce a mixture of normal and modified glutenin or seed- 
storage proteins which can also be used. Mixtures of modified glutenin or 
seed-storage proteins and other plant-based materials can also be prepared to 
5 form improved gluten. 

Throughout this specification, unless the context requires otherwise, 
the word "comprise", or variations such as "comprises" or "comprising", will 
be understood to imply the inclusion of a stated element, integer or step, or 
group of elements, integers or steps, but not the exclusion of any other 
10 element, integer or step, or group of elements, integers or steps. 

In order that the present invention may be more clearly understood, 
preferred forms will be described with reference to the following examples 
and drawings. 

Brief Description of Drawings 

15 Figure 1: Sequence of the pJANGACys7Cys236 vector (SEQ ID NO: 14 

and SEQ ID NO: 15). Restriction sites and enzymes of the MCS of pJKKm 
and pGEM-T plasmids are written in bold. Cloning sites for gene insertion 
into ANG are underlined and written in bold. Cleavage sites for sub-cloning 
the gene into pET lid expression vector are underlined. 

20 Figure 2: Nucleotide (SEQ ID NO: 16) and amino acid sequence (SEQ 

ID NO: 17) of Oleosin Hydrophobic Binding Domain (OHBD). Arrowheads 
are indicating the direction of primer extension. Smaller letters are 
indicating extra nucleotides for cloning and amino acids from ANG molecule. 
Amino acid sequence of the central anti-parallel domain is designated by two 

25 arrows. 

Figure 3: Nucleotide (SEQ ED NO: 18) and amino acid sequence (SEQ 
ID NO: 19) of ANGACys7Cys236 (molecular mass 18.5 kDa) Cysteine residues 
are marked with bold letters. Extra two amino acids and six nucleotides are 
written with smaller letters. 

30 Figure 4: Nucleotide (SEQ ID NO: 20) and amino acid sequence (SEQ 

ID NO: 21) of the starch binding domain of 1,4-a-D-glucan glucohydrolase 
from Aspergillus niger (molecular mass 11.9 kDa). Small letters are indicating 
extra nucleotides for cloning and amino acids from ANG molecule. 
Restriction sites are in bold and underlined. 

35 Figure 5: Nucleotide (SEQ ID NO: 22) and amino acid sequences of 

CM16 (SEQ ID NO: 23) and CM17 (SEQ ID NO: 24) (molecular mass is 13.4 
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kDa]. Small letters are indicating extra nucleotides for cloning and amino 
acids from ANG molecule. Restriction sites are in bold and underlined. First 
amino acid sequence under the nucleotide sequence represents the CM16 
protein, while only differences are shown in CM17 protein. 

Figure 6: Nucleotide (SEQ ID NO: 25) and amino acid sequence (SEQ 
ID NO: 26) of Puroindoline A (molecular mass is 14.3 kDa). Small letters are 
indicating extra nucleotides for cloning and amino acids from ANG molecule. 
Restriction sites are in bold and underlined. 

Figure 7: Expression of the ANG/OHBD/Cys7Cys236 gene. SDS-PAGE 
analysis followed by Coomassie blue staining. Lane A contains standard 
molecular weight markers. Lane B shows a control in which the host E. coli 
cell contains the pET-lld vector. Lane C shows the expression of the 
recombinant ANG/OHBD/Cys7Cys236 protein (migrating slightly further than 
the 30 kDa marker) from pET-lld containing the ANG/OHBD/Cys7Cys236 
gene. 

Figure 8: Expression of the ANG/OHBD/Cys7Cys236 gene. SDS-PAGE 
analysis followed by Western blot using antibodies to C-hordein. Lane A 
contained standard molecular mass markers. Lane B contained extract of an 
E. coli cell expressing a modified C-hordein gene containing a single cysteine 
in the N-terminal region. Lane C contained extract from E. coli containing 
the ANG/OHBD/Cys7Cys236 gene in the pET-lld expression plasmid. 

Figure 9: SDS-PAGE analysis of the purified ANG/CMl6/Cys7Cys236 
gene product by Coomassie blue staining. Lane A contains standard protein 
molecular weight markers. Lanes B to E show ethanol-soluble extracts of the 
crude E. coli lysates. Lane B contained extract from cells containing the 
control plasmid, pET-lld. Lanes C, D and E contained ethanol soluble 
extracts of cells harbouring the pET-lld vector containing the 
ANG/CMl6/Cys7Cys236 gene, prepared from cells 2, 4 and 6 hours, 
respectively, after induction of protein synthesis using IPTG. 

Figure 10: Expression of the ANG/PIN-A/Cys7Cys236 gene. SDS-PAGE 
gel stained with Coomassie blue. Lane A contains standard protein 
molecular weight markers. Lane B contained extract of cells harbouring the 
control pET-lld plasmid, lanes C and D contained extracts of cells 
harbouring the pET-lld vector containing the ANG/PIN-A/Cys7Cys236 gene. 
Lane C contains extract 2 hours after IPTG induction, lane D contains extract 
prepared 6 hours following IPTG inductioa 
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Figure 11: Expression of the ANG/PIN-A/Cys7Cys236 gene. SDS-PAGE 
analysis followed by Western blotting with anti-puroindoline A antibodies. 
Lanes A and B contained extract of cells harbouring pET-lld containing the 
ANG/PIN-A/Cys7Cys236 gene, 2 hours and 6 hours after induction 
5 respectively. Lane C contains a Western blot of cells harbouring a plasmid 
containing the puroindoline A gene alone (not inserted into the 
ANGACys7Cys236 gene). Lane D contained extract of cells harbouring the 
control pET-lld vector. 

Figure 12: Expression of the ANG/PIN-A/Cys7Cys236 gene. SDS-PAGE 

10 analysis followed by Western blotting with anti-hordein antibodies. Lane A 
contains molecular weight markers. Lane B contained extract of cells 
harbouring the control pET-lld vector. Lane C contained extract of cells 
harbouring a plasmid which contains the puroindoline A gene alone. Lane D 
contained extract of cells harbouring pET-lld vector containing the 

1 5 ANG/PIN-A/Cys 7Cys 2 36 gene . 

Figure 13: Plasmids for the transformation of wheat with constructs 
containing novel protein genes. The vectors are based on the plasmid 
pTLZHMWcas which has been modified by insertion of genes into the Kpnl 
and BamHl site region. The vectors are: pTLZ-ANGCys7Cysl3, pTLZ- 

20 ANGCys236 and pTLZ_ANG/OHBD/Cys7Cys236. 

Figure 14: Expression and P-cyclodextrin affinity purification of 
ANG/SBD/Cys7Cys236. Native- and SDS-PAGE gels of column fractions from 
a 5 ml p-cyclodextrin-sepharose affinity column loaded with extracts from 
IPTG-induced cultures of strain AD494(DE3) bearing either pETlld control 

25 plasmid or the pET-SBD plasmid for the expression of ANG/SBD/Cys7Cys236. 
Panel A = SDS-PAGE of fractions from AD494(DE3)/pET-SBD. 
Panel B = native PAGE of fractions from AD494(DE3)/pET-SBD. 
Panel C = SDS-PAGE effractions from AD494(DE3)/pETlld. 
Panel D = native PAGE of fractions from AD494(DE3)/pETlld. 

30 Lane M = molecular mass markers with masses indicated in kDa. 
Lane 1 = the unbound flow-through from column loading. 
Lanes 2-5 = four 5 ml washes with column buffer (50 mM Tris-Cl pH 7.5, 100 
mM NaCl, 40 mM dithiothreitol, 1 mM EDTA). 

Lanes 6-10 = five 5 ml elutions with column buffer containing 15 mM 0- 
35 cyclodextrin. 
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Figure 15: Schematic of the method of producing a modified seed 
storage protein containing a binding domain for a macromolecule. In this 
example, the C-hordein gene is altered to produce an analogue glutenin 
(ANG) protein containing cysteine residues in the N- and/or C-terminal 
5 domains. This protein is then further modified by replacement of part of its 
repetitive domain with a binding domain from another protein. The addition 
of cysteine residues allows for the incorporation of the protein into, for 
example, the gluten macropolymer in wheat flour. The addition of a specific 
binding domain provides for the non-covalent linkage of the gluten network 

10 to other macromolecules (e.g. starch, lipids). Such a protein may be 

expressed in recombinant bacteria, yeast, or other heterologous host for use 
as an ingredient or additive, or it may be produced- in the seed of a transgenic 
plant, for example wheat. 

Figure 16: Schematic of the scheme for identifying transgenes in 

15 transformed wheat plants by polymerase chain reaction (PCR). A primer pair 
straddling the interface between the gene and its promoter (from the gene for 
the high molecular weight glutenin Bxl7) ensures that no false positives are 
detected arising from the high homology between C hordein and gliadin 
genes. 

20 Figure 17: Agarose gel showing PCR-based identification of a 

transgenic wheat plant containing a gene for a modified seed storage protein 
with incorporated ligand-binding domain. In this example, the gene for 
ANG/OHBD/Cys7Cys236 was detected in the plasmid used for transforming 
the wheat plant (lane 1) and in a putative transformant wheat plant (lane 2), 

25 but is not detected in a non-transformed control wheat plant (lane 3). 

Figure 18: Starch-binding activity of ANG/SBD/Cys7Cys236 protein 
assayed by native-PAGE. Protein from each of fraction 6 and 7 from the J3- 
cyclodextrin affinity column of the urea extract of strain AD494(DE3)/pET- 
SBD was loaded onto native gels containing starch (panels B and D) and/or p- 

30 cyclodextrin (panels C and D) or neither starch nor p-cyclodextrin (panel A). 
The reduction in electrophoretic mobility in the presence of starch alone 
(panel B) is indicative of the specific starch-binding activity of each protein. 

Figure 19: Demonstration of the oxidative polymerisation of a 
modified seed-storage protein containing a macromolecule-binding domain. 

35 In this example, ANG/SBD/Cys7Cys236 was oxidised to form disulphide- 
linked polymeric species (indicated by arrows in the scanned image of the 
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SDS-PAGE gel; lanes 1, 2) in the presence or absence of added P- 
cyclodextrin. The polymeric species were capable of being subsequently 
reduced by the disulphide-specific reductant dithiothreitol (DTT; lanes 3, 4), 
showing that the polymerisation was due to the presence of introduced 
5 cysteine residues at the N- and C-termini of the ANG portion of the molecule. 
M = molecular weight markers with masses indicated (kDa); 
Lane 1 = ANG/SBD/Cys7Cys236 oxidised in the presence of p-cyclodextrin; 
Lane 2 = ANG/SBD/Cys7Cys236 oxidised in the absence of (3-cyclodextrin; 
Lane 3 = ANG/SBD/Cys7Cys236 oxidised in the presence of 3-cyclodextrin 
10 and then reduced with DTT; 

Lane 4 = ANG/SBD/Cys7Cys236 oxidised in the absence of p-cyclodextrin 
and then reduced with DTT; 

Lane 5 = reduced ANG/SBD/Cys7Cys236 in the presence of P-cyclodextrin; 

Lane 6 = reduced ANG/SBD/Cys7Cys236 in the absence of P-cyclodextrin. 
15 Modes for Carrying Out the Invention 

MATERIALS AND METHODS 

Bacterial strains and plasmids 

Escherichia coli strain DH5a was used as cloning host strain and E. coli 

strain AD494(DE3) (Novagen) was used as the expression host in this work. 
20 pGEM-T (Promega) was used as cloning vector of PCR products. 

pJKKm(-) (Kirschman and Cramer, 1988) served as cloning vehicle for fusion 

protein genes. New, assembled genes for expression were sub-cloned into 

plasmid pET-lld (Novagen). 

Cloning of ANGACys7Cys236 
25 Restriction endonucleases and DNA modifying enzymes were from 

New England Biolabs and Promega Corp. Other chemicals and reagents were 

of analytical reagent grade. 

Oligonucleotide primers were synthesised on an Applied Biosystem 394 
DNA/RNA Synthesiser using standard phosphoramidite chemistry and were 

30 deprotected by heating in ammonium hydroxide solution. Primers were 
lyophilised and dissolved in TE buffer. 

To amplify up a 477 bp long fragment (approximately 2/3 of the whole 
gene) of C hordein (Accession X60037) from the genomic clone (Lambdahorl- 
17) of barley by polymerase chain reaction two specially designed primers 

35 were used. 
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oligonucleotide 1 (SEQ ID NO: 1): 

Met Arg Cys 
5' G TC ATG A GG CAA CTA AAC CCT TGC AGC CAA GAG TTG CAA TC 
BspH I 3' 
5 oligonucleotide 2 (SEQ ID NO: 2) : 

*** Val Cys . Gin Gin Pro 

5' GGA TCC CTA GAC CAT ACT C CA TAT G CA TG A AGC TT G TTG GGG 

BamH I Nde I Hindlll 

GAC TGG TTG 3' 

10 The reaction was performed in an FTS-4000 Thermal Sequencer 

(Corbett Research, Australia), with 1 cycle of 3 min at 94°C, 20 s at 55°C, 1 
min at 72°C; 36 cycles of 30 s at 94°C, 1 min 30 s at 72°C. The reaction was 
carried out in 50 ul, containing 45 ul of Supermix (BRL Life Technology) and 
1 ng of template DNA and 50 pmol of each oligonucleotides in 5 (il. The 

15 DNA was purified following the QIAquick protocol (QIAGEN) and cloned, 
using the pGEM-T Vector System I (Promega) as was recommended by the 
manufacturer. White transformant colonies selected for growth in LB 
medium supplemented with ampicillin (100 mg/1) were screened for insert- 
bearing plasmid DNA by PCR, Plasmid DNA was purified from positive 

20 clones using Jetstar miniprep columns (Genomed) and the insert was 
sequenced in both direction, using the Prism dye terminator cycle 
sequencing protocol (Perkin-Elmer). One plasmid isolate, containing the 
ANG gene, was designated as pGEM-ANG. 

This plasmid contained two Ndel restriction sites, one within the gene 

25 and one within the pGEM-T multiple cloning site (MCS). To make it unique, 
the ANG gene was subcloned into pJKKm. Because pJKKm has a HindlE site 
in MCS, it had to be deleted before subcloning. 

One ug of the plasmid was digested with 5 units of Hindm enzyme in 
20 ul. Mung Bean Nuclease was used to remove the 5' overhang. Two ul of 

30 10X ZnS04 solution and 0.02 units of enzyme was added to the reaction mix 
and incubated at 30°C. Reaction mix was extracted once with 
phenol/chloroform after one hour incubation. DNA was recovered with 
ethanol precipitation and resuspended in 50 ul. An aliquot of 10 ul of the 
DNA solution was used for ligation. Ligation was carried out in 15 ul 

35 solution at 4°C overnight using T4 DNA ligase. The ligation mixture was 

used to transform competent cells of E. coli strain DH5a by electroporation. 
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Colonies were grown in LB medium supplemented with kanamycin (50 mg/1). 
Plasmid was purified from three colonies as mentioned above. DNA samples 
were tested by performing Hindin enzyme digestions. One clone not 
containing a Hindlll site was chosen and designated as pJKK-H. 



BamHI fragment into the plasmid. Both digested DNA were purified on 
QIAquick columns as described earlier, and ligated in 10 ul solution 
overnight at 14°C using T4 DNA ligase at a molar ratio of 50:1. Transformed 
E. coli competent cells was spread onto LB medium with kanamycin. 

10 Colonies were tested for insert-bearing plasmid DNA by PCR and 3 positive 
clones were sequenced as above. One clone was designated as 
pJANGACys7Cys236 and used for further cloning work to assemble genes for 
fusion proteins. Nucleotide and amino acid sequences of this cloning vehicle 
are shown in Figure 1. 

15 Design and cloning of oleosin hydrophobic binding domain 

The sequence of oleosin hydrophobic binding domain (OHBD) was 
designed to include the consensus sequence of three (maize, rice and barley) 
oleosin proteins. The sequence is almost identical to the sequence published 
for barley oleosin isoform-2 (Aalen 1995, Accession Number X82678). Four 

20 primers were designed to encode the protein (Figure 2). The gene was 

constructed by a modification of the technique of overlap extension, where 
the two long partially overlapping oligonucleotides were further extended 
and amplified by shorter external primers, encoding restriction enzyme 
cleavage sites to clone into pJANGACys7Cys236. PCR amplification was 

25 performed as above, using the same cycle program and Supermix solution. 
Concentration of the long oligonucleotides were 0.1 nM, while 2 uM of the 
short primers in the PCR reaction mix. The PCR product was purified on 
QIAquick column and cloned into pGEM-T plasmid using Promega kit, as 
described above. Three positive clones were used for plasmid preparation 

30 and sequencing to confirm the nucleotide sequence. One clone was used for 
further work and called pGEM-OHBD. 
Cloning of starch binding domain 

The DNA corresponding to the starch binding domain (SBD) of 
Glucoamylase 1 (1,4-a-D-glucan glucohydrolase) of Aspergillus niger 

35 (Accession number: X00548) was amplified by PCR from purified genomic 



5 



pJKK-H was cut with SphI and BamHI to subclone pGEM-ANG SphI- 
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DNA. Primers were designed to allow us to add tails at both ends of the DNA 
fragment for cloning into pJANGACys7Cys236 vector. 
Sequence of the PCR primers are: 
oligonucleotide 7 (SEQID NO: 3): 
5 Gin Ala Cys Thr 

5' C AA GCT T GT ACC ACT CCC ACC GCC 3' 

Hind III 
oligonucleotide 8 (SEQ ID NO: 4): 
lie Cys Arg 

10 5' C CA TAT G CA CCG CCA GGT GTC AGT CAC 3' 
Nde I 

Amplification, cloning into pGEM-T and sequencing was done as 
described above. One clone bearing the gene fragment was designated as 
pGEM-SBD. 

15 Cloning of CM16 and CM17 genes for fusion 

Both genes were amplified from purified wheat (Triticum aestivum) 
genomic DNA by PCR. 
nucleotide 9 (SEQ ID NO: 5): 
5' GTC GGC AAT GAA GAT TGC ACC 3* 
20 nucleotide 10 (SEQ ID NO: 6): 

5' TCC AAC TGC GTT CTC CTC TTG GCC 3' 

nucleotide 11 (SEQ ED NO: 7): 

5' GGA TCC CTA GCT CCA CTG AGA CTC 3* 

For CM16 gene (accession number X55455) oligonucleotide 10 and 11, 
25 while for CM17 gene (accession number X59791) 9 and 11 pairs were used. 
Clones were called pGEM-CMl6 and pGEM-CMl7, respectively. For sub- 
cloning into pJANGACys7Cys236 vector, the genes were PCR amplified again, 
using pGEM clones as template, purified on QIAquick column and digested 
with the appropriate enzymes. Primers used in this amplification were: 
30 nucleotide 12 (SEQ ID NO: 8): 

Gin Ala Leu Gly 
5* TGC GCT C AA GCT T TA GGC AAT GAA GAT TGC ACC 3' 
Hind III 
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nucleotide 13 (SEQ ID NO: 9): 

He Cys Ser 

5" CAT ACT C CA TAT G CA GCT CCA CTG AGA CTC 3' 
Nde I 

5 Cloning puroindoline A gene for fusion 

Lambda genomic clone for puroindoline A (PDSf-A), kindly provided by 
Sadequer Rahman, was used as template to amplify the gene (accession 
number X69913) by PCR. Primers were designed as follows; 
oligonucleotide 14 (SEQ ID NO: 10): 
10 Gin Ala Tyr 

5' C AA GCT TA C GAT GTT GCT GGC GGG 3' 

Hind III 
oligonucleotide 15 (SEQ ID NO: 11): 

5' C CA TAT G CA CCA GTA ATA GCC AAT AGT GC 3' 
15 Nde I 

PCR product was purified, ligated into pGEM-T and sequenced as 

described above. One clone was used and designated as pGEM-PIN-A. 

Fusion of genes or gene fragments with ANG molecule 

pJANGACys7Cys236 vector was cut with Ndel and Hindm restriction 
20 enzymes as all the other pGEM clones and purified on QIAquick columns. 

Ligation was performed at 14°C overnight in 10 pi of solution containing T4 

DNA ligase and inserfcvector at about 20:1 molar ratio. One ul of ligation mix 

was used for transformation of E. coli competent cells and spread onto LB 

plate with kanamycin. Three fusion gene containing colonies were picked up 
25 from each transformation for plasmid preparation and sequencing. The 

clones were called, for example, pJANG/OHBD/Cys7Cys236 in case of the 

oleosin binding domain containing ANG molecule, 

The fused genes were subcloned into pET-lld expression vector 

between Ncol and BamHI sites. These clones were called, for example, 
30 pET-ANG/OHBD/Cys7Cys236. 



WO 00/02914 PCT/AU99/00563 



Sequence of PCR primers for detect! n of genes for 
ANG/domain/Cys7Cys236 in recombinant wheat 

The primers used for the detection of genes for 
ANG/domain/Cys7Cys236 in recombinant wheat are set out below. 

Primer Length Sequence Designation 

Bxl7_3' 23 CAACCATGTCCTGAACCTTCACC SEQIDNO:12 

RGS2024 18 TGGCTGTTGAGGTTGCAC SEQ ID NO: 13 



Expression of fusion proteins 

For protein expression one of the pET clones were transformed into E. 
coli strain AD494(DE3) one day before expression work started. 

10 Small scale expression was carried out in 5 ml 2YT medium, 

supplemented with ampicillin (100 mg/1) using one transformant colony. 
After about 5 hours from inoculation (OD600=0.4), expression of the protein 
was induced by addition of 0.4 mM isopropyl-0 -D-thiogalactopyranoside 
(IPTG). Both induced and uninduced cultures were further incubated for 4 

15 hours at 37°C. Expression was monitored by SDS-PAGE according to 
Laemmli (1970). 

Large scale expression was performed in a shaking flask. One litre 2YT 
medium was inoculated with 1 ml of overnight culture and induced to 
express protein by addition of 0.4 mM IPTG at a cell density of —0.6 Ab. The 
20 culture was incubated with shaking overnight and cells were harvested by 
centrifugation. 
Detection of proteins 

PAGE gels were stained overnight with 0.025% Coomassie Blue R-250 
in 10% TCA. Excess stain was washed away by water-ethanol-acetic acid 
25 (8:1:1) solution. 

Immunological detection of the PIN-A containing fusion protein was 
carried out using the method of Ciaffi et al., (1999). The antibody was raised 
against puroindoline crude extract, kindly provided by Sadequr Rahman. 
Other chimeric proteins were detected in immunoblots using an 
30 antibody raised against Hordein, kindly provided by John Skerritt. 
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Purification of the fusion proteins 

Expressed ANG/CMl6/Cys7Cys236, ANG/CMl7/Cys7Cys236 and 
ANG/SBD/Cys7Cys236 proteins were purified following the method 
published elsewhere (Tamas et al., 1998), except for the precipitation step. 
5 In this work 2 volumes of 1.5 M NaCl were mixed with the 70% ethanol 
extract, rather than 4 volumes of acetone. 

Detection of the gene encoding modified seed storage proteins in 
transgenic wheat plants 



10 plants was determined by polymerase chain reaction (PCR). Reactions were 
carried out in 11.6 ul volume containing 9 ul PCR Supermix (GibcoBRL), 50 
ng template DNA (extracted from wheat leaf tissue using standard protocols), 
172 nmol of each of primers Bxl7_3' and RGS2024, and 0.6 ul of 25 mM 
MgCl 2 . The PCR conditions were 1 cycle of 94°C for 2 min; 35 cycles of 94°C 

15 for 30 sec, 60°C for 30 sec, 72°C for 1 min; 1 cycle of 72°C for 4 min, 25°C for 
l min. The results for one plant containing the gene for 
ANG/OHBD/Cys7Cys236 are shown in Figure 17. The PCR product of 
approximately 600 base pairs is indicative of the presence of the gene. A 600 
bp PCR product is not obtained in PCR reactions from negative control 

20 plants. 

RESULTS 

Design, construction and cloning of pJANGACys7Cys236 

The gene selected to construct this vector for fusion protein coded for 
C hordein. This molecule is a storage protein from barley endosperm and 

25 characterised by an absence of cysteine residues. The barley genomic clone 
encodes a molecule of 261 residues, including a 20 residue signal peptide. 
The gene for mature protein (molecular mass 28 kDa) has 723 nucleotides, 
including a 669 bp long fragment for a central repetitive domain. The 
oligonucleotides for PCR were designed to reduce the size of the central part 

30 and substitute one residue by cysteine in both unique terminal domains. 
Oligonucleotide 1 binds to the 5' end of the gene and has an additional 
sequence at the 5' end to incorporate an initiation ATG codon for methionine 
and a restriction site for BspHI. Oligonucleotide 1 has a T at position 22 to 
replace an A to change the codon of serine to cysteine at position 7. The 3' 

35 end of oligonucleotide 2 is complimentary to a sequence in the repetitive 

domain between 430 and 448 nucleotides. This is a rather unique sequence 



The presence of the gene for modified proteins in transgenic wheat 
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within the strongly repetitive region and also codes for an end of a repeat 
motif. The amplified gene contains only a 411 bp (137 amino acids) long 
fragment for central repetitive domain. This oligonucleotide also contains 
the whole sequence of the C-terminal unique region (6 amino acids) of C 

5 hordein and a restriction site for BamHI, immediately after the stop codon. 
To change the threonine residue at the position of 236 of the full size 
molecule (six residues from the C-terminal end) the oligonucleotide has a C 
at position 26 and an A at position 27 to substitute a G and a T, respectively. 
There is another base pair change in this primer at position 22 to replace C 

10 with A, which is a "wobble" base of the isoleucine codon. This substitution 
allowed the creation of restriction site for Ndel enzyme. Oligonucleotide 2 
has also got six nucleotides which are not part of the C hordein gene, coding 
for alanine and serine. These extra nucleotides were added to create one 
unique restriction site, close to Ndel, within the ANG gene. The two 

15 restriction sites are separated with 4 bp to give easy cleavage for both 
enzymes. The insertion of a gene for another molecule or fragment of a 
molecule between Hindlll and Ndel enabled the present inventors to create 
fusion proteins with new, designed characteristics. 

The gene for ANGACys7Cys236 molecule is 474 bp long and codes for 

20 a protein with a molecular mass of 18.5 kDa (Figure 3). 

PCR amplified DNA was cloned, with two steps, into a plasmid, called 
pJKKm with a modified MCS. Having deleted Hindin restriction site from 
the original plasmid, the cloning vehicle pJANGACys7Cys236 has unique 
Hindlll and Ndel cleavage sites within ANG gene for gene insertion. Size of 

25 the vector is 3873 bp and provides kanamycin resistance to host£. coli cells. 
The gene is sitting in the vector with 5' ends close to the SP6 RNA 
Polymerase transcription initiation site. 

Subcloning engineered genes into pET-lld did not require agarose gel 
purification of the fragments, because of the difference in resistance genes 

30 within plasmids. 

Expression of ANGACys7Cys236 protein 

To check and compare characteristics of this short molecule to 
hordeinCys7Cys236, it has been expressed in both small and large quantities. 
Comparison of the SDS-PAGE patterns of the total cell proteins before and 3 

35 hours after induction with IPTG showed a new band in the induced sample. 
ANG molecule was readily extracted from lysed cells with 70% (v/v) ethanol 
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and precipitated by the addition of 2 volume of 1.5 M NaCl solution. The 
resulting preparation, in the presence of reducing agent (0.1 M DTT), had the 
same mobility as the extra band in E. coli lysate. Apparent molecular mass of 
the protein was about 21 kDa, having a slightly lower mobility in the gel. 
5 This characteristic is not unusual for storage proteins. In the absence of 

DTT, the ethanol extracted sample gave a ladder of bands indicating that the 
protein is able to form long chains through disulfide bonds. 

The ability of the ANG molecule to incorporate into the gluten matrix 
of the dough was confirmed by a series of mixing experiments carried out 
10 using small scale testers. 

Results of SDS-PAGE and mixing experiments showed clearly that 
ANGACys7Cys236 protein had the same or similar properties as the 2 
cysteine residue containing C hordein protein. 

Design, construction and cloning of synthetic gene for oleosin hydrophobic 
15 binding domain (OHBD) 

Amino acid sequence was designed according to a comparison of four 
molecules (one maize, two barleys and one rice), using Genetic Computer 
Group (GCG) program, called "pileup". OHBD gene fragment, for this work, 
contain the entire region of the lipophilic stretch of oleosin (Figure 2), very 

20 similar to barley gene (accession number: X82677). Codon usage was 

designed to avoid long stretch of Gs and Cs, which could have led to mis- 
annealing and sequencing problems. The 5' flanking region of the gene 
fragment consisted of a tripeptide sequence, of the amphipathic N terminus, 
while the 3' end another tripeptide of the C terminus of oleosin. The central . 

25 anti-parallel beta stranded domain had 71 residues. The turn of the anti- 
parallel consists of 13 residues and is the most conserved region. The two 
anti-parallel strands are highly symmetrical in the pairing of residues of 
similar hydrophobicity on the opposite strands. These characteristics are 
very similar to those reported for maize oleosin protein (Huang 1996). 

30 The synthetic gene fragment, flanked with two appropriate restriction 

sites for subcloning, was cloned into pGEM-T vector. One clone, containing 
the correct sequence of a fragment of 243 bp nucleotides, was designated as 
pGEM-OHBD. Size of OHBD fragment was 77 amino acids and had a 
molecular mass of 7 kDa. 
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Cloning f starch binding domain (SBD) 

Glucoamylase 1 from Aspergillus niger comprises two domains, one 
being a catalytic domain (1-470 residues) and the other (509-616 residues) 
being responsible for binding granular starch (Le Gal-Coeffet et al., 1995). 
5 DNA corresponding to the SBD was amplified by PCR method from 

Aspergillus niger purified genomic DNA, cloned into pGEM-T vector, 
sequenced and designated as pGEM-SBD. This clone had a 337 bp long 
fragment (Figure 4) with two restriction sites for insertion into 
pJANGACys7Cys236 vector and an extra alanine before the very first cysteine 
10 of SBD. This residue derived from Hindm restriction site. The binding 

domain had 108 amino acids, including two cysteines, with a molecular mass 
of 11.9 kDa. 

Cloning of CM16 and CM17 gene for fusion 

Both CM16 and CM17 (CM refers to Chloroform/Me thanol soluble) 

15 proteins, reported as members of the a-amylase/trypsin inhibitor family and 
also reported that specific lipids are tightly bound to the fraction, were 
purified from wheat (Kobrehel and Sauvaire, 1990). These two molecules are 
very similar on the amino acid level, but there are a few differences in the 
distribution of charged residues (Figure 5). 

20 To clone these two genes, three primers were used to amplify them in 

two, separate PCR reactions from wheat genomic DNA. One primer 
(oligonucleotide 11) hybridised to the 3' end of the genes, while to 
distinguish between the two genes, two specific primers were designed for 
the 5' ends. pGEM-CMl7 clone carried the gene for only the mature protein. 

25 However, pGEM-CMl6 clone had a few extra base pairs from the signal 
peptide region. Both clones had a DNA fragment encoding mature 
chloroform/methanol soluble proteins with 10 cysteine residues and a 
molecular mass of 13.4 kDa. 

To subclone these genes into ANGACys7Cys236 carrying vector, two 

30 restriction sites were added to one of each ends by PCR, using the same 

primer pairs for both pGEM clones, as templates. Primer corresponding to 
the N-terminus of the mature proteins contained nucleotides for Hindm 
restriction enzyme. It also had an extra alanine residue, and a mutation in 
the first codon (TTA for leucine), substituting valine (GTC) in CM17 and 

35 isoleucine (ATC) in CM16 molecule, because of the sequence requirement for 
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Hindin enzyme. The fragment cloned into ANG cloning vehicle had 360 
nucleotides. 

Cloning puroindoline A (PIN-A) gene for fusion 

The clone for PIN-A protein, which is capable of binding tightly to 
5 both wheat phospholipids and glycolipids (Dubreil et al., 1997), was kindly 
provided by S. Rahman, derived from a genomic library. 

PCR amplified DNA encoded for a 121 amino acid long fragment of 
puroindoline A, 5 residues shorter in the N terminal region than the mature 
protein. This shorter protein had 10 cysteine residues and a molecular mass 
10 of 14.3 kDa (Figure 6). For insertion into ANG molecule, the pGEM-PIN-A 
clone also carried two restriction sites (Hindm and Ndel) and an extra 
alanine, because of Hindlll site requirement. 
Expression of ANG/OHBD/Cys7Cys236 protein 

Having inserted OHBD gene fragment into pJANGACys7Cys236 cloning 
15 vehicle, the entire gene was subcloned into expression vector pET-lld. 

Protein was expressed in E. coli, using AD494(DE3) strain. This genetically 
engineered new gene (717 bp long) coded for a protein with a molecular mass 
of 25.5 kDa. 

Cells were harvested by centrifugation after overnight expression (5 ml 

20 culture) and resuspended in 100 p\ gel loading buffer (0.125 M TRIS/HG1 pH 
6.8, 4% SDS, 10% (v/v) glycerol), containing 0.1 M DTT. Proteins were 
analysed by SDS-PAGE and stained with Coomassie blue (Figure 7). Lane A 
contains standard molecular weight markers, lane B shows a control in which 
the host E. coli cell contains the pET-lld vector, and lane C shows the 

25 expression of the recombinant ANG/OHBD/Cys7Cys236 protein (migrating 
slightly further than the 30 kDa marker) from pET-lld containing the 
ANG/OHBD/Cys7Cys236 gene. The apparent molecular mass of the newly 
synthesised protein (—29 kDa) was larger than that calculated on the basis of 
the nucleotide sequence of DNA. A similar characteristic has been observed 

30 previously for different prolamines of cereals. This fusion protein consisted 
of part of a prolamine and a part of an oleosin molecule. C hordein had been 
considered as a rod shaped molecule, while oleosin hydrophobic binding 
domain has anti-parallel strands, which would penetrate into the oil body 
and thus anchor the protein stably. It means that none of the two fragments 

35 has globular structure as compared with the molecular weight marker 
molecules. Discrepancy in apparent molecular mass on SDS-PAGE was 
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probably due to these two structural phenomena and it was manifested in 
lower mobility. Figure 8 shows a Western blot of an SDS-PAGE gel (run in 
the absence of reducing agents such as dithiothreitol) using antibodies to the 
C-hordein protein. Lane A contained standard molecular weight markers, 
lane B contained extract of an E. coli cell expressing a modified C-hordem 
gene containing a single cysteine in the N-terminal region, and lane C 
contained extract from* coli containing the ANG/OHBD/C y s7Cys236 gene in 
the pET-Ud expression plasmid. The strong antibody cross reaction to 
hordein antibodies at 29 kDa confirms that the 29 kDa protein is the product 
of the ANG/OHBD/Cys7Cys236 gene. 

Expression and analysis of ANG/PIN-A/Cys7Cys236 protein 

A DNA fragment comprised of ANGACys7Cys236 1 and Puroindohne-A 
genes with a length of 840 nucleotides from kanamycin resistant plasmid (see 
above) was subcloned into pET-lld expression vector. One transformant 
colony was picked up and transferred into 5 ml of 2YT medium. Protein 
expression was induced by IPTG and after 2 and 6 hours cells were harvested 
by centrifugation and resuspended in 200 ul of gel loading buffer. SDS-PAGE 
analysis of protein content of bacteria, before and after induction, is shown 
on Figure 10. Lane A contained standard protein molecular mass markers. 
Lane B contained extract of cells harbouring the control pET-lld plasmid, 
lanes C and D contained extracts of cells harbouring the pET-lld vector 
containing the ANG/PIN-A/Cys7Cys236 gene. Lane C contained extract 2 
hours after IPTG induction, lane D contained extract prepared 6 hours 
following IPTG induction. An extra protein band appeared on gel with an 
apparent molecular mass of 35 kDa, after induction (lane D). This expressed 
protein positively reacted with puroindoline A antibody (Figure 11). Lanes A 
and B contained extract of cells harbouring pET-lld containing the 
ANG/PIN-A/Cys7Cys236 gene, 2 hours and 6 hours after induction 
respectively. Lane C contained a Western blot of cells harbouring a plasmid 
containing the puroindoline A gene alone (not inserted into the 
ANGACys7Cys236 gene), and lane D contained extract of cells harbouring the 
control pET-lld vector. All lanes were reacted with anti- puroindoline A 
antibody. Figure 12 shows a Western blot of an SDS-PAGE gel reacted with 
anti-hordein antibodies. Lane A contained molecular mass markers, lane B 
contained extract of cells harbouring the control pET-lld vector, lane C 
contained extract of cells harbouring a plasmid which contains the 
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puroindoline A gene alone, and lane D contained extract of cells harbouring 
pET-lld vector containing the ANG/PIN-A/Cys7Cys236 gene. The expressed 
protein is clearly seen in lane D. 

Analogue glutenin-puroindoline A fusion protein had a slightly lower 

5 mobility on SDS-PAGE as it was calculated (32.8 kDa) on the basis of amino 
acid sequence. This small difference was probably due to the rod shape of 
the ANG part of the molecule. Expressed puroindoline A protein, however, 
had the same mobility as it was calculated. 
Expression of modified proteins following wheat transformation 

10 Microprojectile bombardment is currently the most widely applied 

technique used to transfer genes into wheat. The transfer of genes into wheat 
and their expression is carried out using specific DNA constructs containing 
a selectable marker gene and the gene-of-interest respectively. The selectable 
marker gene used was contained on the plasmid pEmuKON (Chamberlain et 

15 al., 1994). Constructs for the gene-of-interest constructs are shown in Figure 
13. Wheat transformation was carried out using paromomycin selection 
according to the procedures outlined in Witrzens et al., 1998). 
Expression, purification and analysis of ANG/SBD/Cys7Cys236 protein 



20 ANG/ SBD/Cys7Cys236 protein (pET-SBD) were separately transformed into 
strain AD494(DE3). One transformant colony of each was used to inoculate 5 
ml cultures of LB media containing 200 mg/1 ampicillin. After 4 hours 
shaking at 37°C these cultures were used to inoculate 600 ml of the same 
media in 2 1 flasks, and these were incubated with shaking overnight (16 

25 hours) at 37°C. IPTG was added to these cultures 6 hours after inoculation to 
induce expression of protein from the plasmids. Cells from each culture 
were harvested by centrifugation at 10,000 g for 15 min at 5°C. 

One half of the cells from each culture were lysed by incubation for 30 
min at room temperature in 35 ml of 50 mM Tris-Cl buffer pH 7.5 containing 

30 8 M urea, 5 mM EDTA and 1 mM DTT. Insoluble material was removed by 
centrifugation at 2xl0 5 g for 1 hr at 15°C. The supernatants were then 
extensively dialysed against the extraction buffer minus urea (50 mM Tris-Cl 
pH 7.5, 5 mM EDTA, 1 mM DTT) at 5°C. The cloudy precipitates were 
removed by centrifugation at 2xl0 5 g for 1 hr at 5°C and the supernatants (35 

35 mL) stored at -20°C. 



PETlld and the derivative of that plasmid containing the gene for 
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P-Cyclodextrin-binding proteins were purified from 20 ml of each of 
the two dialysed and clarified bacterial extracts by passage through a 5 ml p- 
cyclodextrin-sepharose column equilibrated with 50 mM Tris-Cl pH 7.5, 100 
mM NaCl, 40 mM dithiothreitol, 1 mM EDTA (column buffer). After loading, 
5 the column was washed with 20 ml column buffer and bound proteins eluted 
with 25 ml column buffer containing 15 mM p-cyclodextrin. Five ml 
fractions were collected during loading, washing and elution of the column. 
Fractions were dialysed extensively against water; lyophilised and 
resuspended in 0.1 M Tris-Cl pH 6.8 at ca. 1 (A per ml equivalent of the 

10 original bacterial cell culture. 

Native- and SDS-PAGE gels of the final load fraction and the wash and 
elution fractions are shown in Figure 14. Each lane contained protein from 
the equivalent of 7 ml of the bacterial culture. Two major p-cyclodextrin- 
binding proteins were seen in the elution fractions (see lanes 6 and 7) from 

15 the strain bearing plasmid pET-SBD. These are estimated to be 30 kDa and 
>60 kDa from the SDS-PAGE gel. A 30 kDa protein was also eluted from the 
column of the extract from the control strain bearing plasmid pETlld (panel 
C, lane 6), but it is not produced to the same level as in the former strain. 
Expression, purification and analysis of ANG/CM16/Cys7Cys236 protein 

20 The gene for this fusion protein (834 bp long) was subcloned, from 

kanamycin resistant, pJKKm originated plasmid, into the ampicillin resistant 
pET-lld expression vector, as above. Bacteria was grown at 37°C until 
OD600 reached 0.6 units, then expression was induced by adding B?TG. 
Cells were harvested two, four or six hours after induction and samples were 

25 prepared as in case of SBD chimeric protein. 

Result of the SDS-PAGE analysis is shown in Figure 9 (the gel is 
stained with Coomassie protein stain). Lane A contains standard protein 
molecular weight markers. Lanes B to E show ethanol-soluble extracts of the 
crude E. coli lysates. Lane B contained extract from cells containing the 

30 control plasmid, pET-lld. Lanes C, D and E contained ethanol soluble 
extracts of cells harbouring the pET-lld vector containing the 
ANG/CMl6/Cys7Cys236 gene, prepared from cells 2, 4 and 6 hours 
respectively after induction of protein synthesis using D?TG. The ethanol- 
soluble fraction gave only one band on Coomassie stained gel representing a 

35 protein with an apparent molecular mass of 35 kDa. This was also higher 
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than the calculated (31.9 kDa), and this discrepancy is also explained by the 
unusual shape of the protein. 

Yield of protein expression in large scale, in shaking flask was 30 mg 
protein from 1 litre of medium. 
5 Demonstration of Starch Binding Activity 

The starch-binding activities of the major proteins in fractions 6 and 7 
of the affinity purification column were demonstrated as follows. Fractions 
were loaded onto four native-PAGE gels containing glycerol (10% w/v), 
CHAPS (0.05% w/v) and various combinations of p-cyclodextrin (5 mM) and 

10 starch (corn amylopectin, 0.1% w/v), as indicated in Figure 18. The mobility 
of the protein bands in each gel was compared. It was found that the 
electrophoretic mobility of the major protein band in each fraction was 
significantly retarded in the presence of starch (Figure 18, panel B). The 
specificity of the starch binding activity of these protein bands was 

15 demonstrated by its competitive inhibition by P-cyclodextrin (panels C, D). 
Identification of the purified starch-binding proteins by tryptic-peptide 
mass fingerprinting and N-terminal protein sequencing 

The two major protein bands from the native gel of (J-cyclodextrin 
affinity column fractions from strain AD494(DE3)/pET-SBD were subjected to 

20 in-gel tryptic digestion followed by matrix-assisted laser-desorption time-of- 
flight (MALDI-TOF) mass spectroscopy. The masses of the identified tryptic 
fragments were compared to the theoretical masses of tryptic peptides 
expected from ANG/SBD/Cys7Cys236, as well as being used to search the 
protein sequence databases for other candidate proteins giving matching 

25 tryptic peptides. 

The 30 kDa protein of fraction 6 (Figure 14, panels A, B, lane 6) was 
identified as the periplasmic maltose-binding protein ofE. coli (product of 
the malE gene), based on 17 peptide matches covering 57% of the protein 
sequence. 

30 The major protein band of fraction 7 (Figure 14, panels A, B, lane 7) 

was identified as ANG/SBD/Cys7Cys236 on the basis of 4 matching peptides 
covering 23.11% of the sequence (Table 1). Further proof of the identity of 
this protein was obtained from N-terminal protein sequencing. Ten cycles of 
sequencing yielded the sequence: 

35 MRQLNPCSQE 




which is as expected for ANG/SBD/Cys7Cys236. This provides convincing 
proof of the identity of the purified protein, and confirms that the functional 
data are properties of the modified seed storage protein, as designed. 

5 Table 1. Mass spectral characterisation of ANG/SBD/Cys7Cys236 



Peptide ma 


is Expected 
mass 


Start 


End 


Sequence 


1776.74 


1777.28 


224 


238 
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2193.00 


2193.85 


221 


238 
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2652.26 


2653.54 


198 


220 
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2257.03 


2258.00 


239 


258 


EYTVPQAC * GTSTATVTDTWR 



C* = acrylamide adduct of cysteine 

10 Demonstration of the ability of ANG/SBD/Cys7Cys236 to form disulphide- 
bonded polymers 

The design of the analogue glutenin protein (ANG) included cysteine 
residues at both N- and C-termini to allow disulphide-mediated incorporation 
of the modified seed-storage protein into the gluten macropolymer. The 
15 present inventors have shown by in vitro polymerisation experiments that the 
ability of the ANG/domain/Cys7Cys236 proteins to incorporate into 
disulphide-bonded polymeric species is not impaired by the presence of the 
extra domain. 

The ability to form polymers was demonstrated by copper-mediated 
20 oxidation of ANG/SBD/Cys7Cys236 protein in 50 mM Tris-Cl buffer pH 6.8. 
Three pairs of protein samples were employed, with one of each pair 
containing added p-cyclodextrin (0.5 mM) to analyse any effect of a ligand on 
the disulphide-bonding of the ANG. Two pairs were oxidised for 5 min at 
room temperature in the presence of 0.1 mM CuS0 4 , before chelation of the 
25 metal by addition of EDTA (10 mM). One of these pairs was then reduced in 
the presence of 20 mM DTT. A further pair of samples had no copper added, 
but were reduced with DTT in the presence of EDTA. The samples were 
analysed by SDS-PAGE (Figure 19). There was no effect on oxidation due to 
the presence of the ligand 3-cyclodextrin. The majority of the oxidised 
30 protein sample forms a high molecular weight band which just enters the 
separating gel (lanes 1, 2). That this change in mobility is due to the 
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formation of disulphide-bond polymeric forms is shown by the ability of the 
disulphide-specific reductant DTT (lanes 3, 4) to break the polymer back 
down into the higher-mobility form seen in the reduced sample (lanes 5, 6). 
Detection of the gene encoding modified seed storage proteins in transgenic 



The presence of the gene for modified proteins in transgenic wheat 
plants was determined by polymerase chain reaction (PCR). Reactions were 
carried out in 11.6 /a1 volume containing 9 n\ PCR Supermix (GibcoBRL), 50 
ng template DNA (extracted from wheat leaf tissue using standard protocols), 

10 172 nmol of each of primers Bxl7_3' and RGS2024, and 0.6 jil of 25 mM 

MgCl 2 . The PCR conditions were 1 cycle of 94°C for 2 min; 35 cycles of 94°C 
for 30 sec, 60°C for 30 sec, 72°C for 1 min; 1 cycle of 72°C for 4 min, 25°C for 
1 min. The results for one plant containing the gene for 
ANG/OHBD/Cys7Cys236 are shown in Figure 17. The PCR product of 

15 approximately 600 base pairs is indicative of the presence of the gene. A 600 
bp PCR product is not obtained in PCR reactions from negative control 
plants. 

Demonstration of the ability of ANG/SBD/Cys7Cys236 to form disulphide- 
bonded polymers 

20 The design of the analogue glutenin protein (ANG) included cysteine 

residues at both N- and C-termini to allow disulphide-mediated incorporation 
of the modified seed-storage protein into the gluten macropolymer. The 
present inventors have shown by in vitro polymerisation experiments that the 
ability of the ANG/domain/Cys7Cys236 proteins to incorporate into 

25 disulphide-bonded polymeric species is not impaired by the presence of the 
extra domain. 

The ability to form polymers was demonstrated by copper-mediated 
oxidation of the protein in 50 mM Tris-Cl buffer pH 6.8. Three pairs of 
protein samples were employed, with one of each pair containing added (J- 

30 cyclodextrin (0.5 mM) to analyse any effect of ligand -binding on the 

disulphide-bonding of the ANG moiety. Two pairs were oxidised for 5 min at 
room temperature in the presence of 0.1 mM CuS04, before chelation of the 
metal by addition of EDTA (10 mM). One of these pairs was then reduced in 
the presence of 20 mM DTT. A further pair of samples had no copper added, 

35 but were reduced with DTT in the presence of EDTA. The samples were 

analysed by SDS-PAGE (Figurel9). There was no effect on oxidation due to 



5 



wheat plants. 
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the presence of the ligand p-cyclodextrin. The majority of the oxidised 
protein sample forms a high molecular weight band which just enters the 
separating gel (lanes 1, 2). That this change in mobility is due to the 
formation of disulphide-bond polymeric forms is shown by the ability of the 
5 disulphide-specific reductant DTT (lanes 3, 4) to break the polymer back 

down into the higher-mobility form seen in the reduced samples (lanes 5, 6). 
SUMMARY 

The demonstration of the binding activity of a modified glutenin 
according to the present invention was provided in the example given above 

10 for what has been termed the ANG/SBD/Cys7Cys236 protein. The data 

provided supports the demonstration of the expression and purification of a 
novel protein (apparent molecular mass 60 kDa) from the E. coli strain 
bearing the expression plasmid (pET-SBD), in comparison to the lack of this 
protein in the same strain bearing the control plasmid (pETlld) (Figure 14). 

15 The purification involved the use of an affinity chromatography step in 

which the protein was purified on the basis of its ability- to bind to a column 
matrix containing (3-cyclodextrin. The protein was specifically eluted from 
this column using unbound p-cyclodextrin. This, therefore, represents the 
first example of how the present inventors demonstrate reduction to practice 

20 through the generation of a seed storage protein containing a functional 
ligand binding domain. 

The demonstration of the binding of the major purified proteins to corn 
starch using native polyacrylamide gel electrophoresis, and the inhibition of 
this binding by P-cyclodextrin (PAGE) (Figurel8) is a highly convincing 

25 demonstration of reduction to practice of the present invention. 

The identification of this protein as ANG/SBD/Cys7Cys236 by peptide 
mass fingerprinting and N-terminal protein sequencing further demonstrates 
the functionality of modified seed-storage proteins containing a 
macromolecular binding domain. The anomolous apparent molecular weight 

30 of this protein in SDS-PAGE may be yet another example of the general 
phenomenon of modified C hordein proteins migrating with unexpected 
electrophoretic mobility, as the calculated mass of the protein is only 30 kDa. 
Alternatively, the protein may form a dimer, perhaps around a single bound 
b-cyclodextrin ligand, which is not dissociated by boiling in the presence of 

35 SDS and a reducing agent. 
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The ANG portion forms disulphide-bonded polymeric species was 
demonstrated by native-PAGE of the copper-oxidised affinity purified 
proteins. The fusion protein has been demonstrated to contain the novel 
starch binding activity. Importantly, it is believed that the modified protein 
5 has not lost the ability to polymerise through the cysteine residues at the N- 
and C-termini and therefore the novel protein can be incorporated into the 
gluten macropolymer in wheat flour. 

Demonstration of the capacity of the modified seed-storage protein 
with macromolecule-binding domain (in this instance 
10 ANG/SBD/Cys7Cys236) to form disulphide-bonded polymeric species (Figure 
19). This demonstrates the ability of the protein to be incorporated into the 
gluten macropolymer through disulphide bonds. 

Table 1 demonstrates the identity between the molecular weights of 
the experimentally determined tryptic fragments of the major 3-cyclodextrin- 
15 binding protein (starch-binding protein) purified from the 

ANG/SBD/Cys7Cys236 strain and the theoretical masses of the tryptic 
fragments of the ANG/SBD/Cys7Cys236 protein. The identification of the 
30kDa protein as an endogenous bacterial maltose-binding protein is also 
provided. 

20 It will be appreciated by persons skilled in the art that numerous 

variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are, therefore, to 
be considered in all respects as illustrative and not restrictive. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) NAME: Commonwealth Scientific and Industrial Research 



Organisation 



STREET : Limestone Avenue 

(C) CITY: Campbell 

(D) STATE: ACT 

(E) COUNTRY: Australia 

(F) POSTAL CODE (ZIP): 2601 



(A) NAME: Goodman Fielder Limited 

(B) STREET: Level 42 Grosvenor Place 

(C) CITY: Sydney 

(D) STATE: NSW 

(E) COUNTRY: Australia 

(F) POSTAL CODE (ZIP): 2000 

(A) NAME: Groupe Limagrain Pacific Pty Ltd 

(B) STREET: Level 31, 1 O'Connell Street 

(C) CITY: Sydney 

(D) STATE: NSW 

(E) COUNTRY: Australia 

(F) POSTAL CODE (ZIP): 2000 



(ii) TITLE OF INVENTION: Modified Proteins 



(iii) NUMBER OF SEQUENCES: 2 6 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
■(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GTCATGAGGC AACTAAACCC TTGCAGCCAA GAGTTGCAAT C 41 



(2) INFORMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GGATCCCTAG ACCATACTCC ATATGCATGA AGCTTGTTGG GGGACTGGTT G 51 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CAAGCTTGTA CCACTCCCAC CGCC 24 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CCATATGCAC CGCCAGGTGT CAGTCAC 27 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE' TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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GTCGGCAATG AAGATTGCAC C 



21 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TCCAACTGCG TTCTCCTCTT GGCC 2 4 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGATCCCTAG CTCCACTGAG ACTC 24 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TGCGCTCAAG CTTTAGGCAA TGAAGATTGC ACC 33 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CATACTCCAT ATGCAGCTCC ACTGAGACTC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAAGCTTACG ATGTTGCTGG CGGG 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCATATGCAC CAGTAATAGC CAATAGTGC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CAACCATGTC CTGAACCTTC ACC 



(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TGGCTGTTGA GGTTGCAC 18 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATAGAATACA GCATGCTCCC GGCCGCCATG GCCGCGGGAT TGTCATGAGG CAACTAAACC 60 
CTTGCAGCGT CCCCCAACAA GCTTCATGCA TATGGAGTAT GGTCTAGGGA TCCGGGTACC 12 0 
GAGCTCGAAT TCGCCCTATA 140 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Arg Gin Leu Asn Pro Cys Ser Val Pro Gin Gin Ala Ser Cys lie 
15 10 15 

Trp Ser Met Val 



20 



(2) INFORMATION FOR SEQ ID NO : 16: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 16: 
CAAGCTTTGA AAGCCGCTAC TGCGACAGCA GCTGGATCAT TGCTTGTGCT ATCCGGACTA 60 
ATACTAGCTG GCACAGTCAT AGCACTCACA GTGGCCACAC CAGTGCTAGT CATATTTAGC 120 
CCAGTGCTAG TGCCAGCGGC CATAGCCCTA GCGCTAATGT CAGCAGGCTT TGTCACGTCA 180 
GGCGGGCTGG GCGTGGCTGC GCTGAGCTCC TTTAGTGTGT TAGCCAATAC TGCCTGCATA 240 
TGG 243 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gin Ala Leu Lys Ala Ala Thr Ala Thr Ala Ala Gly Ser Leu Leu Val 
15 10 15 

Leu Ser Gly Leu lie Leu Ala Gly Thr Val He Ala Leu Thr Val Ala 
20 25 30 

Thr Pro Val Leu Val He Phe Ser Pro Val Leu Val Pro Ala Ala He 
35 40 45 

Ala Leu Ala Leu Met Ser Ala Gly Phe Val Thr Ser Gly Gly Leu Gly 

50 55 60 

Val Ala Ala Leu Ser Ser Phe Ser Val Leu Ala Asn Thr Ala Cys He 
65 70 75 80 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ATGAGGCAAC TAAACCCTTG CAGCCAAGAG TTGCAATCAC CACAACAATC ATATCTGCCG 60 
CAGCCATATC CACAAAACCC ATATCTACCG CAAAAACCAT TTCCAGTGCA GCAACCGTTT 120 
CACACACCCC AACAATATTT CCCCTATCTA CCAGAGGAAT TGTTTCCCCA ATATCAAATA 180 
CCAACCCCCC TACAACCACA ACAACCATTC CCCCAACAAC CACAACAACC TCTTCCTCGG 24 0 
CCCCAACAAC CATTCCCCTG GCAACCACAA CAACCATTTC CCCAGCCCCA AGAACCAATT 300 
CCCCAACAAC CATTCCCCTG GCAACCACAA CAACCATTTC CCCAGCCCCA AGAACCAATT 360 
CAACAAATAA TTTTCCAGCA ACCCCAACAA TCATACCCTG TGCAACCTCA ACAGCCATTT 420 
CCTCAACAAC CTCAACCAGT CCCCCAACAA GCTTCATGCA TATGGAGTAT GGTCTAG 477 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Arg Gin Leu Asn Pro Cys Ser Gin Glu Leu Gin Ser Pro Gin Gin 
15 10 15 

Ser Tyr Leu Gin Gin Pro Tyr Pro Gin Asn Pro Tyr Leu Pro Gin Lys 
20 25 30 

Pro Phe Pro Val Gin Gin Pro Phe His Thr Pro Gin Gin Tyr Phe Pro 
35 40 45 

Tyr Leu Pro Glu Glu Leu Phe Pro Gin Tyr Gin lie Pro Thr Pro Leu 
50 55 60 

Gin Pro Gin Gin Pro Phe Pro Gin Gin Pro Gin Gin Pro Leu Pro Arg 
65 70 75 80 

Pro Gin Gin Pro Phe Pro Trp Gin Pro Gin Gin Pro Phe Pro Gin Pro 
85 90 95 
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Gin Glu Pro lie Pro Gin Gin Pro Gin Gin Pro Phe Pro Gin Gin Pro 
100 105 110 

Gin Gin Pro Phe Pro Gin Gin Pro Gin Gin lie He Phe Gin Gin Pro 
115 120 125 

Gin Gin Ser Tyr Pro Val Gin Pro Gin Gin Pro Phe Pro Gin Gin Pro 
130 135 140 

Gin Pro Val Pro Gin Gin Ala Ser Cys lie Trp Ser Met Val 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AAGCTTCTAC CACTCCCACC GCCGTGGCTG TGACTTTCGA TCTGACAGCT ACCACCACCT 60 

ACGGCGAGAA CATCTACCTG GTCGGATCGA TCTCTCAGCT GGGTGACTGG GAAACCAGCG 120 

ACGGCATAGC TCTGAGTGCT GACAAGTACA CTTCCAGCGA CCCGCTCTGG TATGTCACTG 180 

TGACTCTGCC GGCTGGTGAG TCGTTTGAGT ACAAGTTTAT CCGCATTGAG AGCGATGACT 240 

CCGTGGAGTG GGAGAGTGAT CCCAACCGAG AATACACCGT TCCTCAGGCG TGCGGAACGT 300 

CGACCGCGAC GGTGACTGAC ACCTGGCGGT GCATATGG 338 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ala Ser Thr Thr Pro Thr Ala Val Ala Val Thr Phe Asp Leu Thr Ala 
1 5 10 15 

Thr Thr Thr Tyr Gly Glu Asn He Tyr Leu Val Gly Ser He Ser Gin 



WO 00/02914 



PCT/AU99/00563 



Leu Gly Asp Trp Glu Thr Ser Asp Gly He Ala Leu Ser Ala Asp Lys 
35 40 45 

Tyr Thr Ser Ser Asp Pro Leu Trp Tyr Val Thr Val Thr Leu Pro Ala 
50 55 60 

Gly Glu Ser Phe Glu Tyr Lys Phe He Arg He Glu Ser Asp Asp Ser 
65 70 75 80 

Val Glu Trp Glu Ser Asp Pro Asn Arg Glu Tyr Thr Val Pro Gin Ala 
85 90 95 

Cys Gly Thr Ser Thr Ala Thr Val Thr Asp Thr Trp Arg Cys He Trp 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
AAGCTTTCGG CAATGAAGAT TGCACCCCAT GGATGAGTAC TCTGATCACT CCACTCCCAA 60 
GCTGCCGTGA CTATGTGGAA CAACAAGCAT GTCGCATCGA AACGCCCGGG TCGCCGTACC 120 
TCGCCAAGCA GCAGTGCTGT GGGGAGCTTG CAAACATTCC GCAGCAGTGC CGATGCCAGG 180 
CGCTGCGCTA CTTCATGGGG CCGAAGTCTC GTCCGGATCA GAGCGGCCTC ATGGAACTCC 24 0 
CCGGATGCCC TAGGGAGGTG CAGATGGACT TCGTGAGGAT ACTCGTCACG CCGGGGTACT 300 
GCAACTTGAC GACCGTTCAC AACACTCCGT ACTGCCTCGC TATGGAGGAG TCTCAGTGGA 360 
GCTGCATATG G 371 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
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Ala lie Gly Asn Glu Asp Cys Thr Pro Trp Met Ser Thr Leu lie Thr 
15 10 15 

Pro Leu Pro Ser Cys Arg Asp Tyr Val Glu Gin Gin Ala Cys Arg He 
20 25 30 

Glu Thr Pro Gly Ser Pro Tyr Leu Ala Lys Gin Gin Cys Cys Gly Glu 
40 45 



35 



Leu Ala Asn lie Pro Gin Gin Cys Arg Cys Gin Ala Leu Arg Tyr Phe 
55 s0 



Met Gly Pro Lys Ser Arg Pro Asp Gin Ser Gly Leu Met Glu Leu Pro 



70 



Gly cys Pro Arg Glu Val Gin Met Asp Phe Val Arg He Leu Val Thr 
J 90 95 



Pro Glv Tyr Cys Asn Leu Thr Thr Val His Asn Thr Pro Tyr Cys Leu 
105 110 



100 

Ala Met Glu Glu Ser Gin Trp Ser Cys He Trp 
115 120 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ala lie Gly Asn Glu Asp Cys Thr Pro Trp Thr Ser Thr Leu lie Thr 
15 10 15 

Pro Leu Pro Ser Cys Arg Asn Tyr Val Glu Glu Gin Ala Cys Arg He 
20 25 30 

Glu Met Pro Gly Pro Pro Tyr Leu Ala Lys Gin Glu Cys Cys Glu Gin 
35 40 45 

Leu Ala Asn lie Pro Gin Gin Cys Arg Cys Gin Ala Leu Arg Tyr Phe 
50 55 60 

Met Gly Pro Lys Ser Arg Pro Asp Gin Ser Gly Leu Met Glu Leu Pro 
65 7 0 75 BU 

Gly Cys Pro Arg Glu Val Gin Met Asn Phe Val Pro He Leu Val Thr 
85 90 95 




Pro Gly Tyr Cys Asn Leu Thr Thr Val His Asn Thr Pro Tyr Cys Leu 
100 105 110 

Gly Met Glu Glu Ser Gin Trp Ser Cys lie Trp 
115 120 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AAGCTTACGA TGTTGCTGGC GGGGGTGGTG CTCAACAATG CCCTGTAGAG ACAAAGCTAA 60 
ATTCATGCAG GAATTACCTG CTAGATCGAT GCTCAACGAT GAAGGATTTC CCGGTCACCT 120 
GGCGTTGGTG GAAATGGTGG AAGGGAGGTT GTCAAGAGCT CCTTGGGGAG TGTTGCAGTC 180 
GGCTCGGCCA AATGCCACCG CAATGCCGCT GCAACATCAT CCAGGGGTCA ATCCAAGGCG 240 
ATCTCGGTGG CATCTTCGGA TTTCAGCGTG ATCGGGCAAG CAAAGTGATA CAAGAAGCCA 300 
AGAACCTGCC GCCCAGGTGC AACCAGGGCC CTCCCTGCAA CATCCCCGGC ACTATTGGCT 360 
ATTACTGGTG CATATGG 377 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Ala Tyr Asp Val Ala Gly Gly Gly Gly Ala Gin Gin Cys Pro Val Glu 
1. 5 10 15 

Thr Lys Leu Asn Ser Cys Arg Asn Tyr Leu Leu Asp Arg Cys Ser Thr 

20 25 30 

Met Lys Asp Phe Pro Val Thr Trp Arg Trp Trp Lys Trp Trp Lys Gly 
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Gly Cys Gin Glu Leu Leu Gly Glu Cys Cys Ser Arg Leu Gly Gin Met 
50 55 60 

Pro Pro Gin Cys Arg Cys Asn lie lie Gin Gly Ser lie Gin Gly Asp 
65 70 75 80 

Leu Gly Gly lie Phe Gly Phe Gin Arg Asp Arg Ala Ser Lys Val lie 
85 90 95 

Gin Glu Ala Lys Asn Leu Pro Pro Arg Cys Asn Gin Gly Pro Pro Cys 
100 105 110 

Asn lie Pro Gly Thr lie Gly Tyr Tyr Trp Cys lie Trp 



115 



120 



125 
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CLAIMS: 

1. A method of producing a modified glutenin or seed-storage protein, the 
method comprising adding to the protein a domain which confers to the 
modified protein the ability to incorporate into gluten or bind a ligand or 

5 other macromolecule. 

2. The method according to claim 1 wherein the domain comprises one or 
more amino acid residues. 

3. The method according to claim 2 wherein the one or more amino acid 
residues comprise one or more cysteine residues. 

10 4. The method according to claim 3 wherein the one or more cysteine 

residues are incorporated at one or both ends of the amino acid sequence of 
the protein. 

5. The method according to claim 1 wherein the domain is a binding 
domain that confers to the modified protein the ability to bind a ligand or 

15 other macromolecule. 

6. The method according to claim 5 wherein the modified protein further 
contains one or more amino acid residues which confer to the protein the 
ability to incorporate into gluten. 

7. The method according to claim 5 or 6 wherein the binding domain is 
20 capable of binding lipids or starches. 

8. The method according to claim 7 wherein the lipid-binding domain is 
derived from barley oleosin gene or the lipid-binding regions of wheat CM16 
protein. 

9. The method according to claim 7 wherein the starch-binding domain is 
25 derived from glucoamylase from Aspergillus niger. 

10. The method according to any one of claims 1 to 9 wherein the glutenin 
or seed-storage protein is selected from the group consisting of low molecular 
weight glutenins, high molecular weight glutenins, gliadins, puroindolines, 
grain softness proteins, friabilins, and Chloroform/Methanol-soluble proteins. 

30 11. The method according to claim 10 wherein the glutenin or seed-storage 
protein is C hordein from barley. 

12. A modified glutenin or seed-storage protein having a domain inserted 
therein which confers to the protein the ability to incorporate into gluten or 
bind a ligand or other macromolecule. 
35 13. A modified glutenin or seed-storage protein produced by the method 
according to any one of claims 1 to 11. 
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14. A modified glutenin or seed-storage protein selected from the group 
consisting of ANG/SBD/Cys7Cys236, ANG/OHBD/Cys7Cys236, and 
ANG/CMl6/Cys7Cys236, as hereinbefore defined. 

15. An isolated nucleic acid molecule encoding a modified glutenin or 
5 seed-storage protein according to any one of claims 11 to 14. 

16. A cell containing an isolated nucleic acid molecule according to claim 
15 such that on expression of the nucleic acid molecule, the cell produces 
the modified glutenin or seed-storage protein . 

17. The cell according to claim 16 selected from bacteria, yeast, plant, 
10 insect or mammal. 

18. The bacterial cell according to claim 17 being Escherichia coli. 

19. The yeast cell according to claim 17 being Pichia sp. or Saccharomyces 
cerevisiae. 

20. The plant cell according to claim 17 being a recombinant wheat cell. 
15 21. Use of a modified glutenin or seed-storage protein according to any one 

of claims 11 to 14 in the preparation of a food product. 

22. The use according to claim 21 wherein the food product is selected 
from the group consisting of leavened or unleavened breads, pasta, noodles, 
breakfast cereals, snack foods, cakes, pastries, and foods containing flour- 

20 based sauces. 

23. Use of a modified glutenin or seed-storage protein according to any one 
of claims 11 to 14 in the preparation of a non-food product. 

24. The use according to claim 23 wherein the non-food product is 
selected from the group consisting of films, coatings, adhesives, building 

25 materials, and packaging materials. 

25. Use of a grain or part of a grain containing a modified glutenin or seed- 
storage protein according to any one of claims 11 to 14 in the preparation of a 
food product. 
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